SYSTEM, METHOD AND PROGRAM FOR 
PRINTING AN ELECTRONIC DOCUMENT 



BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[000 1] This invention relates to a technology for printing processing in 
which an electronic document (Web document) described in a markup language 
premised on viewing by a Web viewer is downloaded via a network such as the 
Internet and printed onto sheet material having a proper style as a booklet or a 
book. 

2. Description of the Related Art 

[0002] To download and print an electronic document stored on the 
network needs hardware requirements to be enable a continuous connection or 
dialup connection to the Internet. When the electronic document is stored as a 
file (HTML file) described in HTML (HyperText Markup Language), one Web 
page of the electronic document displayed on the screen by the Web viewer may 
include some links to files. Accordingly, the printing processing of the displayed 
page requires a downloading of the linked files in the main HTML file after the 
downloading the main file of the displayed page. 

[0003] In case the files forming the electronic document are temporarily 
stored in a local area of a computer before printing operation, a user needs to 
modify the absolute description of links in the main file to the location of storing 
the files in the local £u:ea. Further, the direct printing of the electronic 
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docximent displayed by the Web viewer causes some problems: Printed 
navigation icons such as a previous-page button icon, a next-page button icon, a 
top-page button icon, and navigation banner, which are necessary in browsing 
operation but not necessary in printed material, reduce density of information 
in printing page unit. 

[0004] In order to shorten the time of printing an electronic document 
described in HTML, Japanese Unexamined Patent Application Publication No. 
11-25156 (Related art 1) describes the method in which a system downloads the 
files forming the Web page, automatically embeds the description of the linked 
file (the linked page) at the point of the link anchor of the main file (the main 
page), and print the new edited file. According to the method disclosed in the 
related art 1, the method may solve such an inconvenience on the operation of 
downloading the linked page again and correcting the description of the link in 
the original page by manual. The conventional method may also reduce a blank 
space of each page at some degree in printed sheets. However, in the method 
disclosed in the related art 1, the description of the linked page is simply 
inserted to the position in the original main HTML file aiid the processed main 
page is simply and continuously printed. Therefore, according to the method 
disclosed in the related art 1, it may be difficult that a printing processing such 
Web pages does not supply a printed material having the proper style of a 
booklet or book. 



SUMMARY OF THE INVENTION 



[0005] It is one object of the present invention to provide a system, a 
method and a computer program for printing such a downloaded electronic 
document, for example a online help fQe or an online manual file, promised on 
viewing by a Web viewer onto sheet material to obtain a real printed material 
having the proper style of a booklet or a book. 

[0006] Specifically, it is the object of the present invention to provide a 
system, a method and a computer program for printing a downloaded electronic 
document as a real printed material having a front cover printed such the title 
of the document, the table of contents and so on; or a header, a footer, or both of 
them printed the title of a chapter printed on pages. 

[0007] Further, it is another object of the present invention to provide a 
system, a method and a computer program for printing any part selected 
arbitrarily by a user from a downloaded electronic document as a real printed 
material having the proper style of a booklet or a book. 

[0008] In order to accomplish the above objects, according to one aspect of 
the present invention, there is provided a system for printing processing an 
electronic document described in a markup language comprising analysis 
defining information storing means for storing analysis defining information for 
analysis which is determined based on a description rule and a template used 
for describing an electronic document in the markup language; cover file 
generating means for extracting cover information displayed on a cover of the 
electronic document based on tag information on a tag and a directive described 
in the document file and for generating a cover file of the electronic document; 
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file analyzing and converting means for analyzing the description of the 
document file based on the analysis defining information, for deleting 
information unnecessary for printing of the document from the document file, 
for analyzing a document structure of the electronic document, for dividing the 
document file on item unit generating the document structure in accordance 
with the analyzed document structure, and for generating a converted file; and 
printing processing means for printing a document with the cover based on the 
cover file and the converted file in accordance with a request for printing the 
electronic document. 

[0009] According to the above-mentioned invention, the system accesses 
an electronic document server for managing an electronic document via a 
network such as the Internet, and obtains document files in a markup language 
(e.g., HTML) forming one electronic document. Further, the system previously 
stores analysis defining information for analysis which is determined based on 
a description nxle and a template used for describing the electronic document in 
the markup language. Then, the system extracts cover information for 
displayed on a cover of the electronic document based on tag information on a 
tag and a directive described in the document file, and generates a cover file of 
the electronic document. Then, the system analyzes the description of the 
document file based on the analysis defining information, and deletes 
information unnecessary for printing of the document firom the document file. 
Then, a system analyzes a document structure of the electronic document, 
divides the document file on item xmit forming the document structure in 
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accordance with the analyzed document structixre, and generates a converted 
file. After that, the system prints a document with the cover based on the cover 
file and the converted file in accordance with a request for printing the 
electronic document. 

[00 10] In addition, according to the present invention, the system, upon 
receiving a request for printing-with-cover indicating to print the electronic 
document with the cover, prints the document with the cover printed such cover 
information like the title of document, the table of contents, a copyrights notice 
based on the cover file and the converted. On the other hand, on receiving a 
request for printing- without-cover indicating to print the electronic document 
without the cover fi^om a user, the system prints the documents with the cover 
information to a header part, a footer part, or both the header and the footer 
parts. 

[00 11] Further, according to the present invention, the system generates 
contents structiire information (structured-contents structure) for generating a 
table of contents of the electronic document and information (file-corresponding 
information) having a corresponding relationship between items of the table of 
contents and the converted file based on the analyzed document structure, and 
displays the table of contents in which the display or not-display of classes in 
the document structure is d3niamically controlled, based on the content 
structure information and the corresponding information, receives the item 
selected for printing fi'om the table of contents, and prints the document 
including the selected item with the cover on the basis of the converted file 
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corresponding to the selected item in accordance with a request of the electronic 
document. 

[0012] As mentioned above, according to the present invention, it is 
possible to print the downloaded electronic document as a real document having 
an appearance of a booklet or a book, irrespective of the connection with the 
Internet of the computer terminal. In particular, it is possible to print the 
electronic document having the front cover printed necessary items such as a 
title of a document, a document code, and a copyright notice. 
[0013] Further, according to the present invention, it is possible to print 
the electronic document having necessary items such as a title of a chapter or 
section and a copyright notice printed in a header part, a footer part, or both of 
them on each page. 

[0014] Furthermore, according to the present invention, it is possible to 
display, on a screen of a computer, a table of structured contents generated from 
the files forming an electronic document and to print only the items selected 
arbitrarily by a user while viewing the table of structured contents and some 
contents of the electronic document. 



BRIEF DESCRIPTION OF THE DRAWINGS 

[0015] FlG. 1 is a block diagram showing an example of a configuration of 
an electronic document printing system according to the present invention; 

FIG. 2 is a diagram showing an example of a display screen of a page 
viewer for displaying a page as a printing target in the electronic document; 
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FIG. 3 to FIG. 5 are diagrams showing parts of source of an HTML file of 
a body text page as shovm in FIG. 2; 

FIG. 6 to FIG. 8 are diagrams showing a source of a converted file; 

FIG. 9 is a diagram showing an example of a display screen of a table of 
structured contents and a page of the converted file; 

FIG. 10 is a diagram showing images of a real document as a printing 
result; 

FIG. 11 is a flowchart for file obtaining processing; 

FIG. 12 to FIG. 14 are diagrams showing parts of a source of the 
downloaded file "index.html" so as to describe the file obtaining processing; 

FIG. 15 is a flowchart for file analyzing and converting processing; 

FIG. 16 is a diagram showing a part of a source of the HTML file so as to 
describe the downloading operation in data analyzing and converting 
processing; 

FIG. 17 is a diagram showing examples of tags as an analyzed target; 
FIG. 18 is a diagram showing examples of tags added in the conversion; 
FIG. 19 is a diagram showing examples of items (modules) forming a 
table of structured contents structure; 

FIG. 20 is a flowchart for structured-contents display processing; 
FIG. 21 is a diagram showing examples of display icons; 
FIG. 22 is a flowchart for printing processing; 
FIG. 23 is a flowchart for cover forming processing; 
FIG. 24 is a flowchart for preview processing; and 
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FIG. 25 is a diagram showing examples of functions (modules) used for a 
computer program realizing the electronic document printing system. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 

[0016] FIG. 1 is a diagram showing an example of an electronic document 
printing system according to the present invention. An electronic document 
printing system 1 connects to an electronic docimient server 3 via a network 2. 
The electronic document server 3 manages an electronic document 4 premised 
on being displayed by a Web viewer and let the electronic document 4 
downloads based on a user's request. The electronic document 4 is an HTML 
file described in HTML. The electronic document printing system 1 has a file 
obtaining unit 10, a file analyzing and converting unit 20, and a printing 
processing unit 40. 

[0017] The file obtaining unit 10 downloads the electronic document 4 via 
the network 2 from the electronic document server 3. The file obtaining unit 10 
has a downloaded-file determining unit 11 and a downloading unit 12. The 
downloaded-file determining unit 11 determines a plurality of files forming one 
electronic document 4 as a target for printing based on a URL (Uniform 
Resource Locator) assigned by for example a user. The downloading unit 12 
downloads the files determined as a target for downloading by the downloaded- 
file determining unit 11 firom the URL of the files forming the electronic 
document 4. Further, the downloading unit 12 stores the downloaded files in 
the electronic document printing system 1. The downloaded files of the 
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electronic document 4 are assumed as an HTML files 51. 

[0018] The file analyzing and converting unit 20 analyzes the description 
of each file of the HTML files 51 downloaded by the file obtaining unit 10 on the 
basis of analysis defining information 50. Then, the file analyzing and 
converting unit 20 converts the HTML files 51 to a converted file 53 the 
description of which is converted to be suitable for printing onto sheet material 
and forms a cover file 52 based on the HTML files 51. The file analyzing and 
converting unit 20 has a cover-file generating unit 21, an analyzing and 
converting unit 22, and a contents-structure generating unit 23. 
[0019] The analysis defining information 50 is information for the analysis 
of tags as targets in the analyzing processing, the determination rule used in 
the analyzing processing, and addition of tags in the converting processing, the 
information of which is determined on the basis of the template or the 
description rule used upon describing the electronic document 4 in HTML. The 
electronic document printing system 1 stores the analysis defining information 
50 predetermined by the structural characteristic of the HTML file expected to 
download. 

[0020] The cover^file generating unit 21 generates the cover file 52 fi^om 
the file for the top page in the HTML files 51 downloaded by the file obtaining 
unit 10. The front cover of a document 6 is printed based on the cover file 52 in 
the printing processing. The analyzing and converting unit 22 analyzes the 
description of each file of the HTML files 51 on the basis of the analysis defining 
information 50 and converts the HTML files 51 so that the HTML files 51 be 
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suitable for the printing onto sheet materiad. Specificsdly, the analyzing and 
converting unit 22 analyzes the description of each file of the HTML files 51 and 
downloads a HTML file or an image file referred in a body text of the document. 
Further, the analyzing and converting unit 22 deletes information unnecessary 
in the printed document 6. 

[0021] Furthermore, the analyzing and converting unit 22 analyzes a 
structure of contents of the electronic document 4 based on the description of 
the HTML files 51. Then, the analyzing and converting unit 22 divides one file 
of the HTML files 51 into such an item unit like "chapter", "section" and "unit" 
of the document structure and stores the converted files 53 in the electronic 
document printing system 1 through the analyzing and converting processing. 
[0022] Further, based on the result of the analyzing processing, the 
analyzing and converting unit 22 stores file-corresponding information 54, 
which is a corresponding relationship between a hierarchical structure of the 
converted files 53 divided according to the structure of the electronic document 
4 and the converted files 53 used for obtaining each title of items in the table of 
structured contents. The contents-structure generating unit 23 forms a 
structured-contents structure 55 used for managing the table of structured 
contents displayed by printing processing unit 40 on the basis of the analyzing 
result of the analyzing and converting unit 22. The structured-contents 
structure 55 is information on the structure of the electronic document 4, 
namely, a hierarchical relationship of the converted files 53. 
[0023] The printing processing unit 40 prints the contents of the HTML 
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files 51 as a real printed material having the proper style of a booklet or a book. 
Further, the printing, processing unit 40 has a structured-contents displaying 
unit 41, a document displaying unit 42, a printing item obtaining unit 43, a 
printing unit 44, and a previewing unit 45. 

[0024] The structured-contents displaying unit 41 displays the table of 
structured contents indicating the hierarchical relationship with the converted 
files 53 in tree structure on the Web viewer (not shown in FIG.l). In the table 
of structured contents, each file of the converted files 53 corresponds to the item 
unit such as "chapter, section, and item" of the document structure of the 
electronic document 4. The structured-contents displajdng unit 41 dynamically 
controls the display or not-display status of the item of the table of structured 
contents by such a user's operation like a click. The structured-contents 
displaying unit 41 can additionally displays the icon set by the predetermined 
file category of the file of the converted files 53 corresponding to the item of the 
table of structured contents. 

[0025] The document displa3ring unit 42 displays, on the Web viewer, the 
content of the file determined as the head item of the table of structured 
contents or the content of the converted files 53 corresponding to the item 
selected of the table of structured contents. The printing item obtaining unit 43 
obtains the item selected as a target of printing in the table of structured 
contents displayed on the Web viewer. The printing unit 44 prints the item 
obtained by the printing item obtaining unit 43 onto sheet material having the 
proper style of a booklet or a book as the document 6. The previewing unit 45 
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displays the contents of the item of the document 6 set for printing. 

[0026] The general description of processing of the electronic document 

printing system 1 is given below. 

[0027] It is assumed that the electronic docimient 4 downloaded by the file 
obtaining unit 10 in the electronic document printing system 1 is an online 
manual of a personal computer, which manual is described in HTML. FIG. 2 
shows an example of a display screen 100 of a Web viewer for displaying one 
page as a target of printing in the electronic document 4. FIG. 3 to FIG. 5 show 
examples of parts of source of the HTML file of a body text of the page displayed 
on the display screen 100. The parts of source shown in FIG. 3 to FIG. 5 are of 
one HTML file, FIG. 3 shows a head part, FIG. 4 shows an intermediate part, 
and FIG. 5 shows the end part. 

[0028] Generally, as shown on the display screen 100, the Web page in the 
electronic document 4 is displayed on the Web viewer based on the description 
of the HTML file interpreted for display and comprised of a contents 110 
indicating the table of contents and a body text 120 indicating the contents of 
the electronic document 4. The page as the body text 120 is included a 
navigation button 130 as a part of the electronic document 4 and the contents of 
the HTML file for display, and the contents of body text 120 is combined and 
obtained from plural classes of the structure of the document for displaying. As 
indicated by the contents 110, an item "1. Name and function of unit" in the 
body text 120 denotes the content belonging to the class direct below "Chapter 
1: Name of unit" of the first class, namely, the item in the second class (section). 
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Fxirther, an item "personsJ computer main body (hereinafter, referred to as a PC 
main body)" and an image 140 of the PC main body in the body text 120 denote 
the contents belonging to the class direct below "1. Name and function of unit", 
namely, the items in the third class (item). 

[0029] The file analyzing and converting unit 20 analyzes and converts 
each file of the downloaded HTML files 51 so as not to print the Web page of the 
online manual as exactly shown on the display screen 100 but to print the page 
in a appropriate style for a real printed material (a manual). That is, the file 
analyzing and converting unit 20 generates a cover file 52 for printing the front 
cover of the document 6 to print the electronic document 4 as a proper styled 
printed material. Then, the file analyzing and converting unit 20 deletes 
unnecessary information such as the navigation button 130 incorporated from 
the content in the page of the body text 120. Further, the file analyzing and 
converting unit 20 divides the HTML files 51 corresponding to the body text 120 
into such an item as "chapter, section, or item" in the document structure on the 
item unit basis, and generates the converted files 53 from the divided items. 
Then, the file analyzing and converting unit 20 stores, as file-corresponding 
information 54, information about a hierarchical relationship among the 
divided and generated classes of the converted files 53 as a structured-contents 
structure 55. Specifically, the file analyzing and converting unit 20 analyzes 
that the description of the HTML file 51 includes "1. Name and function of unit 
(Chapter 1, Section 1)" as the item in the second class (section) forming 
"Chapter 1: Name of unit" in the first class (chapter) and "front of personal 
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computer main body (hereinafter, referred to as front of PC main body) 
(Chapter 1, Section 1, Item 1)" as the under-item in the third class (item). Then, 
the file analyzing and converting unit 20 divides the HTML file 51 into the 
items based on the analyzing result, and generates two converted files 53. 
[0030] FIG. 6 to FIG. 8 show the source of the converted files 53. FIG. 6 
shows the source of the file of "1. Name and function of unit" divided from the 
HTML file as items of the second class (section) shown in FIG. 3 to FIG. 5. FIG. 
7 and FIG. 8 show the source of the file of "front of PC main body" as the item in 
the third class (item) divided from the HTML file 51. 

[0031] FIG. 9 shows an example of the display screen 200 displa3ring the 
table of structured contents and the page based on the converted files 53 
subjected to the analyzing and converting processing. On the display screen 
200 are displayed a structured contents 210 indicating, in tree structure, the 
table of structured contents of the electronic document 4 and a body text 220 
indicating the content based on the converted files 53, which is obtained by 
deleting the information unnecessary for the printing and by dividing it on the 
item unit basis. Referring to FIG. 9, one of the converted files 53 in the page 
displayed on the body text 220 is the file indicating "1. Name and function of 
imit" in the items of the second class (section). Thus, as shown the body text 
120 in FIG. 2, the content "front of PC main body" in the third class (item) and 
the image 140 in the PC main body are not displayed simultaneously with the 
content "1. Name and function of unit". 

[0032] The user can select the items for printing from the table of 
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structured contents displayed on the structxired contents 210 of the display 
screen 200. The printing processing unit 40 combines the cover file 52 and the 
converted files 53 corresponding to the selected items into one file for printing 
job, and prints the file thereof. FIG. 10 shows the image of the printed material 
of the document 6 as a printing result. Thus, the electronic document printing 
system 1 can obtain the docviment 6 as the real printed material having the 
appearance of the booklet as shown the image in FIG. 10 from the electronic 
document 4 described as the HTML files 51 premised on displaying on the Web 
viewer as shown on the display screen in FIG. 2. 

[0033] Here, a detailed description of processing of various means in the 
electronic document printing system 1 is given. FIG. 11 shows a flowchart for 
the file obtaining processing. The downloading unit 12 accesses a URL inputted 
by user's operation or other programs' execution, then downloads the HTML file 
having the description of the table of contents of the electronic document 4, for 
example the "index.html" file (step SI). The downloaded-file determining unit 
11 generates a list of the HTML files as targets of downloading from the 
downloaded 'lndex.html" file, and calculates the total number of files to be 
downloaded (step S2). Specifically, the downloaded-file determining unit 11 
extracts the URL which is defined by <href> from a first <hr> tag to a last <hr> 
tag in the "index.html" file, and generates the list of the HTML file to be 
downloaded based on the extracted URL. When the same URL is described a 
plurality of times and is has been overlapped to the URL registered to the list, 
it is ignored. 
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[0034] FIG. 12 to FIG. 14 show parts of the soxirce of one loaded 
"index.html" file. FIG. 12 to FIG. 14 show the source of a continuous portion 
from the head portion of the ' index.html" file. The URLs defined by <href> 
existing fi^om a portion shown by a downward arrow in FIG. 12 to a portion 
shown by an upward arrow in FIG. 14 are URLs of the HTML file as the 
description of the items forming the table of structured contents. The 
downloaded-file determining unit 11 registers the URLs to the list of the 
downloaded file. The downloading unit 12 downloads the HTML file 
constituting the electronic document 4 based on the list of the downloaded file, 
and stores the downloaded files as the HTML files 51 (step S3). 
[0035] FIG. 15 shows a flowchart for the file analyzing and converting 
processing. The file analyzing and converting unit 20 obtains the URLs of the 
files downloaded from the file obtaining unit 10 and a path to a directory 
storing the HTML files 51. The analyzing and converting unit 22 determines 
whether or not the HTML files 51 to be analyzed exists (step Sll). When the 
HTML files 51 to be analyzed exists, the processing sequence after step S12 is 
performed- The analyzing and converting unit 22 downloads the HTML file and 
such a file as an image file defined in the HTML files 51 and stores the obtained 
files (step S12). Specifically, the analyzing and converting unit 22 refers to all 
the HTML files 51 in the downloading order, downloads the HTML file based on 
the URL defined by <href> from the first <hr> tag to the last <hr> tag in the 
HTML file, and further downloads the image file defined by an <img> tag. The 
analyzing and converting unit 22 does not download the same file as the 
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downloaded file to avoid duplicated storing. 

[0036] FIG. 16 shows a part of the source of the HTML files 51 
downloaded by the file obtaining unit 10. For example, the HTML file is 
downloaded with the URL defined based on the <href> tag existing between a 
downward arrow and an upward arrow shown in FIG. 16. 
[0037] The analyzing and converting unit 22 analyzes the description of 
the HTML files 51 based on the analysis defining information 50, deletes the 
unnecessary information, divides the description of the file into each unit of 
"chapter, section, or item" forming the document structure as the converting 
processing. Then, the analyzing and converting unit 22 generates and stores the 
converted file and writes the analyzing result as the file-corresponding 
information 54 (step S13). For example, the analyzing and converting unit 22 
deletes the information described before the existing position of the first <hr> 
tag and the information described after the existing position of the last <hr> tag 
in the HTML files 51 in progress of the analysis. The analyzing and converting 
unit 22 deletes the <href> tag in the HTML files 51. Further, when the HTML 
files 51 includes <hl>, <h2>, and <h3> tags, the analyzing and converting unit 
22 generates one file by the information described before the existing position of 
these tag, and further generates another file by the information described 
sequent thereto. 

[0038] Here, a file name comprising "number string of 12 figures" and a 
".html" extension are set to the converted files 53. The 12-figure portion in the 
file name is equally divided into four portions by three figures 
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("000111222333.html"), and segments the four portions into a first class, a 
second class, a third class, and fourth class, starting fi-om the head. For 
example, the file name is as follow: 

1) The name of the file of "Chapter 1" is "001000000000.html", and the 
file thereof belongs to the first class, 

2) The name of the file of "Chapter 2" is "002000000000.html", and the 
file thereof belongs to the first class, 

3) The name of the file of "Chapter 2, Section 1" is "002001000000.html", 
and the file thereof belongs to the second class, 

4) The name of the file of "Chapter 2, Section 2" is "002002000000.html", 
and the file thereof belongs to the second class, 

5) The name of the file of "Chapter 2, Section 2, Item 1" is 
"002002001000.html", and the file thereof belongs to the third class, and 

6) The name of the file of "Chapter 3" is "003000000000.html" and the file 
thereof belongs to the first class. 

[0039] The URLs (file names) of the HTML file are registered in the 
structured order to the file-corresponding information 54 so as to obtain the 
titles indicating the items in the table of structured contents. 
[0040] Next, the cover-file generating unit 21 generates the cover file 
(HTML file) 52 based on data in the downloaded HTML files 51 by the 
downloading unit 12 (step S14). For example, the cover-file generating unit 21 
extracts the manual name, the manual code, and information on the copyright 
notice to be described on the fi^'ont cover, from the HTML files 51, for example a 
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"top.html" file, forming the top page in the electronic document 4, and generates 
and stores the cover file 52. 

[0041] Specifically, the cover-file generating unit 21 generates the cover 
file 52 as follows: 

1) The cover-file generating unit 21 determines a linefeed code and 
obtains a character string, on line unit basis, firom the description of the target 
HTML file 51 such as the "top.html" file, 

2) The cover-file generating unit 21 checks whether or not characters 
other than the HTML tag exists in the obtained character string and obtains 
characters other than the HTML tag, 

3) If the above procedure is performed as the first one, the cover-file 
generating unit 21 sets the obtained characters (character string) other than 
the HTML tag as "manual name", 

4) If the above procedure is performed as the second one, the cover-file 
generating unit 21 sets the obtained characters (character string) other than 
the HTML tag as "manual code", 

5) The cover-file generating unit 21 checks whether or not the obtained 
character string includes characters such as "copyright", and then, in case the 
characters of "copjrright" exists in the obtained characters, sets the obtained 
characters (character string) as the copyright notice, and 

6) The cover-file generating unit 21 insert the character strings set as the 
manual name, manual code, and copyright notice into the predetermined 
template of the fi:-ont cover sheet as the HTML file, thereby generates the cover 
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file 52. 

[0042] The analysis defining information 50 used for the analysis and 
conversion processing is information defining the tag as the targets of analyzing 
or adding in the converted files 53. 

FIG. 17 shows examples of the tag as a target of analyzing which is 
defined by the analysis defining information. FIG. 18 shows examples of the 
tag added in the converting processing. Referring to FIG. 17, <hl>, <h2>, and 
<h3> tags are used for the determination upon dividing the file. The <hr> tag 
is used for the determination upon deleting the unnecessary information. An 
<href> tag is used for the determination upon downloading the HTML file. An 
<src img> tag is used for the determination upon downloading an image file. A 
<title> tag is used when the title information of the HTML file is necessary. 
The <title> tag is used upon obtaining a character string (title) registered to 
item in the table of structured contents or upon obtaining the character string 
of the item selected when the front cover page is formed in the printing 
processing or the preview processing. 

[0043] An <meta http-equiv= "Content-TVpe" content= "text/html; 
charset=iso-2022-jp"> tag is inserted into the header portion of the converted 
file (HTML file) 53 which is generated through the analyzing processing. An 
<STYLE TYPE="text/css">H6{page-break-before:always}</STYLE> tag is 
inserted into the header portion of the HTML file for printing job in the printing 
processing or the preview. processing. In case the tag is inserted and an <H6> 
tag is described in the HTML file, the page-break is executed at the position of 
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the tag described. The <H6> tag is described the position set the page-break 
command in the HTML file for printing job in the printing processing or the 
preview processing. 

[0044] Referring to FIG. 18, an <NAME>***</NAME> tag is used in the 
header portion in the cover file 52. A character string indicating a manual 
name (document name) is described as a directive between the tags. A 
<CODE>***</CODE> tag is used in the header portion in the cover file 52. A 
character string indicating the manual code (document code) is described as a 
directive between the tags. A <COPYRIGHT>***</COPYRIGHT> tag is 
used in the header portion in the cover file 52. A character string indicating the 
copyright notice is described as a directive between the tags. A <K*> 
</K*> tag indicates which class the HTML file belongs to. The tag information 
is used upon changing the displayed class of the item corresponding to the 
HTML file 51. For example, a set of <K1> </Kl> tags indicates the first class 
and a set of <K2> </K2> tags indicates the second class. 
[0045] A <BASEFILENAME="***"> tag indicates the file name of the 
converted files 53 after the completion of the analyzing and converting 
processing. The tag information is used upon changing the displayed class, an 
<SUBTITLE="***"> tag indicates the title of the converted files 53. The 
information of this tag is used upon changing the displayed class, a <viewdocs> 
<K*> <BASEFILENAME="***"> <SUBTITLE="***"> </K*> </viewdocs> tag is 
inserted into the top of the body portion in the converted files 53 generated in 
the analyzing and converting processing. 
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[0046] Fiirther, the contents-structure generating imit 23 generates the 
structured-contents structure 55 based on the file-corresponding information 54 
(stepSlS). 

[0047] The structured-contents structure 55 is information for managing 
the configuration of the table of structured contents. FIG. 19 shows examples of 
modules comprising the structured-contents structure 55. The structured- 
contents structure 55 comprises the following modules. An "HTREEITEM 
hTreeltem" module is a tree handle, for managing the table of structured 
contents, transmitted upon registering an item to the table of structured 
contents. The tree handle is obtained by a return value of an "InsertltemO" 
function which is used upon registering the item to the table of structured- 
contents. A "CString strTitle" module indicates the title displayed as an item in 
the table of structured contents. An ' 'int iRoot" module indicates a flag for 
determining whether or not an item belongs to a root class (first class). If the 
value of the module equals zero (=0), the module indicates the item belongs to a 
child class, namely, the second or subordinated class. If the value of the module 
equals one (= 1), the module indicates the item belongs to the root, namely, the 
first class. The "int iRootCnt" module indicates the registered number of the 
mother item to the table of structured contents. An 'int iDeep" module is a flag 
for determining which class an item belongs to. If the value of the module 
equals one (= 1), the module indicates the item belongs to the first class. If the 
value of the module equals two (= 2), the module indicates the item belongs to 
the second class. If the value of the module equals three (= 3), the module 
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indicates the item belongs to the third class. If the value the module equals 
four (= 4), the module indicates the item belongs to the fourth class. A "CString 
strPath" module indicates path information of the registered converted-file 53. 
A "bool bCheckOn" module is a flag for determining whether or not the item is 
selected on the table of structured contents. If the value of the module is TRUE, 
module indicates the check box of the table of structured contents is ON and the 
item selected. If the value of the module is FALSE, the module indicated the 
check box of the table of structured contents is OFF and the item not-selected. 
[0048] The contents-structure generating unit 23 obtains information on 
class relationship between the files based on the file name having 12-figure 
numbers assigned to each file of converted file 53, and sets the obtained 
information to the structured-contents structure 55. 

[0049] FIG. 20 shows a flowchart for the structured-contents display 
processing. The structured-contents display processing is executed by the 
structured-contents displaying unit 41 in the printing processing unit 40. The 
structured-contents displaying unit 41 displays the table of structured contents 
based on the information set to the structured-contents structure 55 (step S21). 
Specifically, the structured-contents displaying unit 41 extracts the item of the 
first class from the structured-contents structure 55, obtains the file name of 
the converted files 53 corresponding to the extracted item from the file- 
corresponding information 54. Then, the structured-contents displaying unit 41 
extracts the character string (title) of the <TITLE> tag in the obtained 
converted-file 53, and displays, in tree structure, the title extracted on the 
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structured contents 210 of the display screen 200. Thus, the title of each 
chapter as the item of the first class is displayed as a default of the table of 
structured contents. 

[0050] The. structxired-contents displaying unit 41 displays the title of the 
table of structured contents and the icon corresponding to the type of item 
description. For example, the structured-contents displaying unit 41 calculates 
the appearance ratio of tags of the same kind from the first <hr> tag to the last 
<hr> tag in the converted files 53 corresponding to the item to be displayed. If 
a predetermined tag appears by a ratio of 90% or more, the icon representing to 
the tag is displayed. 

[0051] FIG. 21 shows examples of the icon. Referring to FIG. 21, if the 
<table> tag appears in the converted files 53 at a predetermined ratio, a table 
icon is displayed on the item of the table of structured contents. If the <img> 
tag appears in the converted files 53 at a predetermined ratio, an image icon is 
displayed. If the <a> tag appears in the converted files 53 at a predetermined 
ratio, a link icon is displayed. If the three tags do not satisfy the predetermined 
appearance ratio, a book icon is displayed. 

[0052] The document displaying unit 42 displays, on the body text 220 on 
the display screen 200, the description of the converted files 53 (HTML file) 
registered to the head item of the table of structured contents (step S22). 
[0053] FIG. 22 shows a flowchart for the printing processing. The printing 
processing is executed by the printing item obtaining unit 43 and the printing 
unit 44 in the printing processing unit 40. The printing item obtaining unit 43 
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obtains the item selected from the table of structured contents and the setting 
of presence or absence of the cover printing (step S31). The printing item is 
selected by user's marking with the check box of the item displayed on the 
structured contents 210 of the display screen 200, The presence or absence of 
the cover printing is determined by selecting the menu item for the cover 
printing operation set in the operation designating menu portion (menu bar) or 
the check box set in the cover printing operation window (not shown) on the 
display screen 200. Then, the user selects the menu item or marks with a check 
on the check box to set an indication of the printing the front cover. The 
printing item obtaining unit 43 determines whether the front cover is present or 
absent (step S32). If "presence of the front cover" is determined in step S32, the 
cover generating processing is performed (step S33). 

[0054] FIG. 23 shows a flowchart for the cover generating processing. The 
printing unit 44 generates the file for printing job in a work area of a memory 
unit, and copies the description of the template of the cover file (step S40). 
Further, the printing unit 44 determines whether or not the item exists in the 
table of structured contents based on the structured-contents structure 55 (step 

541) . When the item exists, the items of the table of structured contents are 
fetched one by one and determined whether or not the item is selected (step 

542) . If the item of the table of structured contents is selected by referring to 
the value of the "bCheckOn" flag, the printing unit 44 generates a temporary 
structure in the work area of the memory unit. Then, the printing unit 44 
determines whether or not a counter value of the total number of titles stored in 
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the temporary structure is not more than a predetermined number (for example 
the value of 15) (step S43). 

[0055] When the counter value of the total of titles stored in the temporary 
structure is not more than the predetermined number, the printing unit 44 
obtains and stores the title of the item selected on the table of structured 
contents into the temporary structure. After that, the counter value of the total 
number of titles is counted up (step S44). Specifically, the printing unit 44 
obtains the file name of the converted files 53 corresponding to the extracted 
item from the file-corresponding information 54. Then, the printing unit 44 
extracts and stores the character string (title) of the <title> tag in the obtained 
converted file 53 into the temporary structure, and increments the counter 
value of the total number of titles by one. Upon checking the class of the 
selected item and determining that the items belong to plural classes, the 
printing unit 44 stores only the title of the item of the upper class. Further, the 
titles of all the selected items may sequentially be stored. 

[0056] • If the table of structured contents includLes the selected item which 
is not processed yet, the printing unit 44 performs the processing in steps S43 
and S44 during the term the counter value of the total number of titles does not 
reach the predetermined number. After the counter value of the total number of 
titles reaches the predetermined number, the printing unit 44 stores the 
character string "others... " at the end of the titles in the temporary structure 
(step S45). After that, the printing unit 44 determines the character string 
from the cover file 52 by linefeed code and obtains it by one line unit (step S46) 
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and determines whether or not the <a name=f "item"> tag exists in the obtained 
character string (step S48) through the end of the file (step S47). When the <a 
name="item"> tag exists in the obtained character string, the printing unit 44 
inserts the title of the item stored in the temporary structure into the file for 
printing job (step S49). 

[0057] When the <a name="item"> tag does not exist in the character 
string, the printing unit 44 obtains the character string by one-line unit (step 
S46), and repeats the processing in steps S48 and S49 through the end of the 
file (step S47). 

[0058] After performing the cover generating processing (FIG. 22: step 
S33), the printing unit 44 extracts the converted files 53 corresponding to the 
selected item in the order set in the file-corresponding information 54, and 
combines the converted files 53 in the order for extraction to the file for printing 
job (step S34). It is determined whether the presence or absence of the front 
cover upon printing (step S32). If "the absence of a front cover" is determined, 
the printing unit 44 extracts the converted files 53 corresponding to the selected 
item in the order set in the file-corresponding information 54 and combines the 
converted files 53 into the file for printing job in the extracting order (step S35). 
[0059] The printing unit 44 adds a header part, a footer part or both of 
them and the page number to each pages except the firont cover of the file for 
printing job, and then prints the file for printing job onto sheets of paper so as 
to output the document 6 (step S36). For example, the printing unit 44 inserts 
the character strings of the title of the online manual and each title of items to 
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the header part and the page number and the copyright notice to the footer part 
of the page for printing. 

[0060] FIG. 24 shows a flowchart for the preview processing. The preview 
processing is executed by the previewing unit 45 in the printing processing unit 
40. The previewing unit 45 determines whether or not the preview operation is 
selected (step S51). If the preview operation is selected, the previewing unit 45 
obtains the file for printing job from the printing unit 44 (step S52). Then, the 
previewing unit 45 add the header part, the footer part, or both of them, and 
the page number to each page except the front cover of the file for printing job, 
and displays the contents of the file on a preview screen (not shown) (step S53). 
The previewing unit 45 closes and ends the preview screen when either return 
or end button is selected by click on the preview screen (step S54). 
[0061] According to the present invention, the processing unit, functions, 
or components are realized as a processing program which is read and is 
executed by a computer. Further, the processing program according to the 
present invention is stored in a proper recording medium which are read by a 
computer, such as a movable medium memory, semiconductor memory, or hard 
disk. The processing program is recorded to these recording medium and is 
provided. Alternatively, the processing program is provided by the reception 
and transmission using various communication networks via a communication 
interface. 

[0062] FIG. 25 shows examples of the functions (modules) used by the 
processing program when the electronic document printing system is realized 
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by the computer-readable processing program. The description of the functions 
are given below: 

1) "OnSearchO * of "FindKyeword.cpp" is a function for starting a search 
processing, 

2) "MySetTreeViewStatusO" of "HTMLPrint.cpp" is a function for storing 
a value in structured contents structure 55 and 
"MyUnionHtmlFileO" is a function for combining the HTML files, 

3) "OnOptKailO" of "HTMLPrintDoc.cpp" is a function for selecting 
"Display up to the first class", "OnOptKai2()" is a function for selecting "Display 
up to the second class", "OnOptKaiSO" is a function for selecting "Display up to 
the third class", "OnOptKai4()" is a function for selecting "Display up to the 
fourth class", "OnFileOpenWebO" is a function for selecting "Open Web manual" 
in the file menu and "OnFileOpenManu" is a function for selecting "Open 
Manual" in the file menu, 

4) "SetTreeViewStatusO" of "MainTreeView.cpp" is a function for selecting 
an item of the table of structured contents, "OnSelchangedO" is a function for 
displaying the file of the selected item, "OnCheckOnAllO" is a function for 
selecting "check all items" in the pop up menu, "OnCheckOfFAllO" is a function 
for selecting "reset all items" in the pop up menu, "OnCheckOnUnderO" is a 
function for selecting "check all items under classes" in the pop up menu, 
"OnCheckOffUnderO" is a function for selecting "reset all items" in the pop up 
menu, "OnCheckOnUnderPageO" is a function for selecting "check all 
subsequent pages" in the pop up menu, "OnCheckOffUnderPageO" is a function 
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for selecting "reset all subsequent pages" in the pop up menu, 
"GetCheckStatusO" is a function for determining on or off of check boxes in the 
entire table of structure contents, "MySetCheckStatusO" is a function for 
determining on or off of check box and setting a flag and "MySelectltemO" is a 
function for setting designated item in the table of structured contents to 
selected state, and 

5) "SetPrintPreviewO" of "MainWebView.cpp" is a function for executing 
printing or previewing processing. 

[0063] According to the present invention, the user obtains an electronic 
document, for example a downloaded online help file or online manual file, as 
printed material having proper style of a booklet or a book, in irrelevant to the 
status of connection with the Internet. That is, according to the present 
invention, the user obtains a downloaded electronic document as printed 
material when the continuous access to the Internet or the dialup access is 
impossible. Further, the user can browse, by the Web viewer, the contents of 
the electronic document selected as a target for printing into the printed 
material and confirms the contents of the file to be printed. 

[0064] Furthermore, according to the present invention, the description of 
the files forming the electronic document are analyzed and converted to a 
description suitable for the printing as sheet material like paper. Thus, as 
compared with the simple continuous-printing of the HTML file of the electronic 
document formed for displajdng, the description of the document to be printed is 
compressed and the number of printed sheets is reduced. 
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[0065] Furthermore, according to the present invention, the user 
arbitrarily selects only the item for printing from the table of contents 
indicating the items generating the electronic document in tree structure, and 
obtains the document including only selected item as printed material with the 
proper form of booklet or a book. 

[0066] Although the present invention is described according to the 
embodiment, it can variously be modified within the essentials of the present 
invention. According to the above-described embodiment of the present 
invention, the electronic document printing system 1 has the file obtaining unit 
10 including the downloading function via the network 2. However, the file 
obtaining unit 10 in the electronic document printing system 1 does not have 
the downloading function, and can read the HTML files 51 stored in an external 
storage device and can perform the file analyzing and converting processing and 
the printing processing, 

[0067] Further, according to the above-described embodiments of the 
present invention, the electronic document printing system 1 includes one 
pieces of the analysis defining information. However, the electronic document 
printing system 1 can have a plxirality of pieces of the analysis defining 
information which define information previously determined by the description 
rule or the template used in the case of describing in the markup language, the 
electronic document corresponding to the various electronic documents as the 
target of the printing processing. 

[0068] The entire disclosure of Japanese Patent Application No. 2003- 
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305028 filed on October 18, 2002 including specification, claims, drawings and 
summary are incorporated herein by reference in its entirety. 
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